What Is Claimed Is: 

\1 . A method for facilitating use of a collation element that supports a 

2 large number of characters, comprising: 

3 receiving the collation element; 

4 reading a primary weight value from a primary weight field within the 

5 collation element; 

6 if the primary weight value falls within a reserved set of values, reading an 

7 additional portion ortiie primary weight value from a secondary weight field and a 

8 tertiary weight field within the collation element; and 

9 if the primary weight value is not within the reserved set of values, 

1 0 reading a secondary weight value from the secondary 

1 1 weight field within the collation element, and 

12 reading a tertiary weight value from the tertiary weight field 

1 3 within the collation element. 

1 2. The method of claim 1, wherein if the primary weight value falls 

2 within a reserved set of values, the methodVdditionally comprises: 

3 setting the secondary weight value to \\ secondary default value; and 

4 setting the tertiary weight value to a tertiary default value. 

1 3. The method of claim 1, wherein the collation element adheres to a 

2 structure specified in Unicode Technical Report No. l(k 

1 4. The method of claim 1 , \ 

2 wherein the primary weight value identifies a character^ 
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3 \ wherein the secondary weight value can specify an accent on the character; 

4 and \ 

5 wherein the tertiary weight value can specify case information for the 

6 character. \ 

1 5. Th^ method of claim 1, wherein the collation element is four bytes 

2 in size, of which thetorimary weight field is two bytes, the secondary weight field 

3 is one byte and the tertiary weight field is one byte, unless a value in the primary 

4 weight field belongs to t^e reserved set of values, in which case the primary 

5 weight field takes up all fotor bytes of the collation element. 

1 6. The method of claim 5, wherein the reserved set of values for the 

2 primary weight value includes hexadecimal values OxFFFO-OxFFFF. 

1 7. The method of claim l\ wherein the collation element is taken from 

2 a collation weight table that is used to map characters to collation weights in order 

3 to establish an ordering between strings o^characters. 

1 8. The method of claim 7, furthencomprising constructing a sorting 

2 key for a string by: \ 

3 reading each character in the string; \ 

4 looking up a corresponding collation element for each character from the 

5 collation weight table; and \ 

6 adding the corresponding collation element for each character to the 

7 sorting key. \ 

1 9. The method of claim 8, \ 
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2 \ wherein the sorting key is associated with a record within a database; and 

3 \ wherein the sorting key is used to construct a linguistic index for the 

4 database. 

1 10 A A computer-readable storage medium storing instructions that 

2 when executedsby a computer cause the computer to perform a method for 

3 facilitating use ora collation element that supports a large number of characters, 

4 the method comprising: 

5 receiving the collation element; 

6 reading a primary Veight value from a primary weight field within the 

7 collation element; \ 

8 if the primary weight value falls within a reserved set of values, reading an 

9 additional portion of the primary height value from a secondary weight field and a 

10 tertiary weight field within the collation element; and 

11 if the primary weight value is not within the reserved set of values, 

12 reading a secondarWeight value from the secondary 

13 weight field within the collatW element, and 

14 reading a tertiary weighrwalue from the tertiary weight field 

1 5 within the collation element. \ 

1 11. The computer-readable storage medium of claim 10, wherein if the 

2 primary weight value falls within a reserved set of value^, the method additionally 

3 comprises: \ 

4 setting the secondary weight value to a secondary denmlt value; and 

5 setting the tertiary weight value to a tertiary default valu^. 
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\ 12. The computer-readable storage medium of claim 10, wherein the 
collation element adheres to a structure specified in Unicode Technical Report 
No. 10\ 

13. \ The computer-readable storage medium of claim 10, 
whereimthe primary weight value identifies a character; 

wherein the secondary weight value can specify an accent on the character; 
and \ 

wherein the tertiary weight value can specify case information for the 
character. \ 

14. The computeAreadable storage medium of claim 10, wherein the 
collation element is four bytes m size, of which the primary weight field is two 
bytes, the secondary weight field ifc one byte and the tertiary weight field is one 
byte, unless a value in the primary weight field belongs to the reserved set of 
values, in which case the primary weight field takes up all four bytes of the 
collation element. \ 

15. The computer-readable storageSmedium of claim 14, wherein the 
reserved set of values for the primary weight vame includes hexidecimal values 
OxFFFO-OxFFFF. \ 

16. The computer-readable storage mediunkof claim 10, wherein the 
collation element is taken from a collation weight table that is used to map 
characters to collation weights in order to establish an ordering between strings of 
characters. \ 
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1 \ 17. The computer-readable storage medium of claim 16, wherein the 

2 method further comprises constructing a sorting key for a string by: 

3 reading each character in the string; 

4 looRmg up a corresponding collation element for each character from the 

5 collation weight table; and 

6 adding tne corresponding collation element for each character to the 

7 sorting key. \ 

1 18. The computer-readable storage medium of claim 1 7 5 

2 wherein the sorting key is associated with a record within a database; and 

3 wherein the sorting ltey is used to construct a linguistic index for the 

4 database. \ 

1 1 9. An apparatus that facilitates use of a collation element that 

2 supports a large number of character^ comprising: 

3 an assignment mechanism that\s configured to read a primary weight 

4 value from a primary weight field withiiMhe collation element; 

5 wherein if the primary weight valuk falls within a reserved set of values, 

6 the assignment mechanism is configured to read an additional portion of the 

7 primary weight value from a secondary weightVield and a tertiary weight field 

8 within the collation element; and \ 

9 wherein if the primary weight value is not within the reserved set of 

10 values, the assignment mechanism is configured to, \ 

1 1 read a secondary weight value from the secondary weight 

1 2 field within the collation element, and to\ 

1 3 read a tertiary weight value from thk tertiary weight field 

14 within the collation element. \ 
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1 \ 20. The apparatus of claim 19, wherein if the primary weight value 

2 falls wWiin the reserved set of values, the assignment mechanism is configured to: 

3 serthe secondary weight value to a secondary default value; and to 

4 set theNertiary weight value to a tertiary default value. 

1 21 . The apparatus of claim 19, wherein the collation element adheres 

2 to a structure specifiedvin Unicode Technical Report No. 10. 
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22. The apparatus of claim 19, 

wherein the primary weight value identifies a character; 

wherein the secondary weight value can specify an accent on the character; 



and 



wherein the tertiary weight vahae can specify case information for the 
character. 

23. The apparatus of claim 19, wherein the collation element is four 
bytes in size, of which the primary weight fielcrus two bytes, the secondary weight 
field is one byte and the tertiary weight field is one byte, unless a value in the 
primary weight field belongs to the reserved set of wilues, in which case the 
primary weight field takes up all four bytes of the collation element. 



1 24. The apparatus of claim 23, wherein the reserved set of values for 

2 the primary weight value includes hexidecimal values OxFFFO-OxFFFF. 
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\ 25. The apparatus of claim 19, wherein the collation element is taken 
from a collation weight table that is used to map characters to collation weights in 
order to establish an ordering between strings of characters. 

26. THe apparatus of claim 25, further comprising a key construction 
mechanism for constructing a sorting key for a string, wherein the key 
construction mechanism is configured to: 

read each character in the string; 

lookup a corresponaing collation element for each character from the 
collation weight table; and to\ 

add the corresponding conation element for each character to the sorting 
key. \ 

27. The apparatus of claim 2(k 

wherein the sorting key is associated! with a record within a database; and 
wherein the sorting key is used to construct a linguistic index for the 



database. 
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